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Abstract 

In the classical theorems of extreme value theory the limits of suit- 
ably rescaled maxima of sequences of independent, identically distributed 
random variables are studied. So far, only affine rescalings have been 
considered. We show, however, that more general rescalings are natural 
and lead to new limit distributions, apart from the Gumbel, Wcibull, 
and Frechet families. The problem is approached using the language of 
Renormalization Group transformations in the space of probability den- 
sities. The limit distributions are fixed points of the transformation and 
the study of the differential around them allows a local analysis of the 
domains of attraction and the computation of finite-size corrections. 
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1 Introduction 



The basic problem of Extreme Value Theory (EVT) is the following (see 
Ref. [T] for a primer). Given a equence of n independent identically dis- 
tributed (i.i.d.) random variables, we ask how the properly rescaled maxima 
of the sequence are distributed when n — > oo. Not surprisingly, EVT has 
much importance from the point of view of applications in the natural sci- 
ences El HI [3] , finance [UJ, and engineering [7j, to name a few. In all these 
fields one often encounters problems possessing a threshold value for some 
quantity and wants to know the probability that it be exceeded (catastrophic 
events are a good illustrative example). This question is similar in spirit to 
that answered by the central limit theorem, which deals with the limits of 
rescaled sums of i.i.d. centered random variables. In both cases one tries to 
find out whether some kind of universality exists, so that the family of limit 
distributions is small and their domains of attraction are easy to describe. 

The problems of EVT and the central limit theorem are naturally addressed 
in the framework of the Renormalization Group (RG), the deepest formalism 
used in modern Physics to understand how a system behaves under a change 
of the scale of observation. For a treatment of the central limit theorem results 
and stable distributions in this setup see Refs. [8j [9]. Only recently has EVT 
been tackled from the perspective of the Renormalization Group [10LI1H [T2] . In 
the latter references, the main motivation was to advance the understanding 
of the convergence to the limit when the size of the sample data increases. 
Herein, we employ the RG language to try to discuss and solve a different 
fundamental problem on the acceptable rescalings and limits of maxima of 
sequences of i.i.d. random variables. We describe it next. 

Let p be a probability density in E and p its distribution function, 



Then, the distribution function for the maximum value of n i.i.d. random 
variables with probability density p is given by 



In the limit of large n, P n concentrates around the maximum of the support 
of p. It is not surprising that in order to obtain a non-trivial limit we have 
to rescale the random variable. To the knowledge of the authors, the only 




M n {x) = p{x) 



n 



and the corresponding probability density reads 



Pn(x) = —M n (x). 
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rescaling thoroughly studied so far is the affine one: starting from Frechet [13] 
and Fisher and Tippet [13] there has been an extensive literature considering 
the possible limits of P n (a n x + b n ) and the domains of attraction. Actually, 
Frechet only considered the case b n = while Fisher and Tippet gave the 
expression for the possible limit distributions with full generality. Finally 
Gnedenko [15] completed the solution of the problem by describing rigorously 
the domain of attraction of the different limit distributions. But a natural 
question is: why to admit only affine rescalings? At this point, nothing better 
than quoting from [16j : 

An interesting side issue is why this formulation was adopted at all 
with its affine normalization of M n . Fisher and Tippet did not explain 
this, whereas Gnedenko offered only the analogy with stable distribution 
theory for sums, which seems to be begging the question. Perhaps the real 
explanation is that no one came with an alternative formulation that lead 
to interesting results. The same explanation is still valid today. 

In this work we try to fill this gap and explain that there are, indeed, interesting 
results beyond the affine rescaling. 

The rest of the paper is organized as follows. In Section [2] we recast the 
problem of EVT into the RG formalism and show that when the restriction 
of affine rescalings is relaxed new interesting limit distributions (fixed points 
in the RG approach) appear, apart from the Gumbel, Weibull, and Frechet 
families. In Section[3]the domains of attraction of the fixed points are studied. 
Section [3] is devoted to finite-size corrections, i.e. the modification of the limit 
distributions when the sample size, n, is large but finite. In Section [5] some 
illustrative numerical tests are given. Finally, the conclusions are presented in 
Section [6J 

2 Renormalization Group transformation 

Define the RG transformation 

r s/ i(x):=/i( 5s (x)) re , (2.1) 

where, for reasons that will become evident later, we use s = logra to pa- 
rameterize it. Note that the definition of T s requires the choice of a rescaling 
function g s . To the date only the case in which g s is an affine transformation 
has been studied. 

In terms of the probability density p the transformation (|2.ip reads 

T sP {x) = g' s {x)nn{g s {x)) n - l p{g s {x)). 
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Considering that the support of P n is equal to the support of p it is natural 
to ask that g s be a homeomorphism from the support of p onto itself, in this 
way T s preserves the support of the probability density we start with. We 
remark that this condition is not imposed in the works focusing on the affine 
rescaling. 

The transformation T s can be extended to continuous s once the appro- 
priate g s is defined. A natural requirement for the transformation T s is that 
it forms a uniparametric group, i.e. 

T 8l o T S2 = T Sl J rS2 . 

Given the choice of parameterization, this holds provided that 

9s 2 og sl = g Sl +s 2 , (2.2) 

and from now on we assume that this is the case. If we take g s differentiable 
with respect to s, condition (|2.2p is equivalent to saying that g s is solution of 
the differential equation 

^9s(x) = f{g s (x)), 
with initial condition go(x) = x and 

f( x ) = 4~9s(x) . 

OS s=0 

We are actually interested in the possible extreme limiting distributions, 
i.e. in M = lim s _ > . 00 T s p. This, together with the continuity of T s , implies that 
M must be a fixed point of the renormalization group transformation, 

M{g s {x)) n = M{x), with n = e s . (2.3) 

Assume that this equation has a solution with probability density P{x) = 
dM(x)/dx whose support is denoted by E. Several important consequences 
follow. 

(i) For s > and x in the interior of S, g s (x) > x or equivalently f(x) > 0. 

Proof: If x is in the interior of X the distribution function verifies M(x) £ 
(0, 1) and is monotonically increasing. We have n = e s > 1 and, there- 
fore, M(x) > M{x) n and if (fSTHj) holds we must have M(g s (x)) > M(x), 
which implies g s (x) > x and consequently f(x) > (f(x) = implies 
g s (x) = x). 

(ii) If x* is at the boundary of E then fix*) = 0. 

Proof: A simple consequence of the fact that g s is a differentiable, uni- 
parametric group of homeomorphisms of E and therefore we must have 
g s {x*) = x*. 
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(iii) f(x*) = if and only if x* is the maximum or the minimum of £. 

Proof: f(x*) = implies g s (x*) = x* and if fl£3) holds M(x*) n = M(x*). 
But this is possible only if M(x*) = (x* minimum of £) or M{x*) = 1 
(x* maximum of £). The converse is contained in (ii). 

(iv) / can have at most two ceros and the boundary of £ at most two points. 
Therefore we have three possibilities: £ is the the real line (—00,00), 
the semi-infinite line [a, 00) or (—00,6], or the closed interval [a, b]. 

For any of the three cases mentioned in (iv) we shall take a group of maps 
that preserve £ and study the associated RG flow. 

• Case 0: £ = (—00,00). 

In this case a natural and simple choice for the group of maps is the 
group of translations, i.e. 



The most general limiting distributions (or fixed points of the renormal- 
ization group) for this transformation is 



g s (x) = x + - 



a > 0. 



a 



M = e 



-\e- a: 



A > 0. 



• Case 1~: £ = (-00, 0]. 

The simplest choice for g s is, in this case, 



9s{x) = e 



s/a 



X 



a > 0. 



And the corresponding limiting distribution is 



Mr = e - A (- x) 



A > 0. 



• Case 1+: £ = [0,oo)]. 

The maps that preserve the semi-infinite line are 



g s (x) = e s/a x, a > 0, 



and the limiting distributions 




A > 0. 
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• Case 2: £ = [0,1]. 

A simple choice for the uniparametric group of maps is 

g s (x) = x e s/ ", a > 0, 

that leads to the following family of limiting distributions 

M 2 (x) = e~ x( - losx)a , A>0. 

Note that in all cases two free positive constants a and A appear, whose 
role is easy to understand: a fixes the scale for the group parameter s and A 
can be changed into Ae s by the action of the group of maps, that transform 
a fixed point of the RG into another one. The fixed points of Cases 0, 1~ 
and 1 + are well known in the literature and comprise the so-called Gumbel, 
Weibull, and Frechet distributions. While in Cases 0, 1 + and 1~ the rescaling 
is affine, it is not so in case 2. The limit distributions of Case 2 are new and 
the subject of our analysis. 

3 Limit distributions with compact support 

Let us consider the RG transformation (|2.1|) for 

g s (x) = x*- s/a , (3.1) 

where a is a positive real number. That is, we concentrate on Case 2 from 
Section [2j Note in passing that the case a = 1 contains some interesting 
distributions among the possible fixed points. When a = 1 the general fixed 
point of the transformation is 

M(x) = x x , A > 0. 

Therefore 

P{x) = Ax A ~ x 

and, if A = 1, we get the uniform distribution. 

It is easy to determine the domain of attraction of a given fixed point when 
g s is of the form (|3.ip . We have the following result: 

Proposition. A given random variable supported in [0, 1] with cumulative 
probability distribution \i converges weakly (or in law) after succesive appli- 
cations of the RG transformation T s to M(x) = e -A (~ logx ) a , i.e. 

lim T s fi(x) = M(x) for all x £ [0, 1], 



G 



if and only if 

lim = A. (3.2) 

^i(-logx) Q v ; 

Proof: If (|3.2p holds, then we can write 

= e -A(-loga;) Q +o((-logx)") j 

and therefore 

T s fl(x) = H(x n ~ 1/a ) n = e -A(-logx)-+no(i(-logxO-) ) (3 _ 3) 

where n = e s . The large s limit in the expression above yields 

lim T s /j,(x) = M(x) for every x S [0, 1]. 

s— >oo 

To prove the converse note that the convergence of T s /i, taking logarithms 
and for x 7^ 0, 1, can be expressed as 

-log / u(x n ~ 1/a ) 
lim n- 



n->-oo (—log a;)" 
or equivalently 

— logu(x n 1/c 

lim 



n^oo (_ \ g( x n 1/a ))« 

But given that if Owe have linin^oo x n =1, therefore 

lim 7 l0g ^ W = A. 
(— log x) a 



□ 



We should insist that the novelty of these fixed points and attraction do- 
mains resides in the non- linear rescaling function g s , which in turn is motivated 
by the (natural) requirement that the rescaling preserves the support of the 
initial random variable. If we had considered the standard affine rescaling, the 
fixed points and their domains of attraction would have corresponded to the 
Weibull distributions with exponent a. 

In the next section we continue the study of the new fixed points with the 
analysis of the finite-size corrections. 
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4 Finite size corrections 



To discuss the amplitude of finite-size corrections and their shape, i.e. the be- 
havior of the extremal distributions when the number of i.i.d random variables 
n is large but finite, we must study the neighborhood of the fixed points and 
the linear approximation of the RG transformation (|2.ip . For this we compute 
its differential at a probability distribution /i acting on rj: 

(DT^ri = n^g^T-^igsix)). (4.1) 

The stable and unstable directions of a fixed point \x are given by the eigen- 
values and eigenvectors of (|4.ip at /i. They determine the amplitude of the 
finite-size corrections and their shape. 

We focus on the case g s (x) = x n 1/a , with n = e s and fixed point M{x) = 
e -A(- logz)"^ In order to solve the eigenvalue equation for (|4.1|) it is very useful 
to consider the following ansatz rj(x) = M(x)(p(x). In terms of it the eigenvalue 
equation reads 

(OT^M^^x) = vM{x)ij>{x), 
that due to the properties of the fixed point M(x) reduces to 

n<f>{g s {x)) = u(f>(x). 

This is solved by 

<t>p{ x ) = (- 1 ogx) /3 , 

with eigenvalue v„ = n l ~Pl a . A perturbation of the fixed point is unstable 
(or relevant, in the RG terminology) if the corresponding eigenvalue is greater 
than one, i.e. f3 < a and it is stable (irrelevant) if (3 > a. The case /3 = a 
consists of a perturbation tangent to the line of fixed points and, therefore it 
corresponds to a purely marginal direction. 

Note that the above analysis is consistent with the domains of attraction 
determined in Section [3l The stable directions are precisely those that do not 
alter the limit in ()3.2|) . the marginal ones induce an infinitesimal change in 
the limit and therefore also in the fixed point to which the perturbed distri- 
bution tends, and finally an unstable perturbation makes the limit diverge, 
implying that the perturbed distribution does not converge under successive 
applications of the RG transformation. 

To understand how the linear analysis above is useful to determine the 
finite size corrections, consider the following situation. We start with a ran- 
dom variable with cumulative distribution /i(x) expanded in the eigenvectors 
obtained above, 

/,(x) = M(x)(l + J><^0r)), (4.2) 

i 
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where the terms in the sum are ordered according to their eigenvalues, so that 
v f , > v r , for i < j. 

Pi Pj J 

Assuming that all eigenvalues are smaller than one (/% > a) or, in other 
words, that p(x) belongs to the domain of attraction of M(x), one can show 
that 

T s p(x) = M{x) + cm l ~^l a M{x)(t) h + o(n 1 - /3l / Q ). 

Hence, the largest eigenvalue determines the behavior with n (the size of the 
system) of the amplitude of the dominant correction while its eigenfunction 
determines the shape of the correction. One can also study corrections of 
higher order and go beyond the linear approximation. In the next section we 
show how to accomplish this and compare our approximations with numerical 
implementations of the statistical models to test their reliability. 



5 Numerical tests 

In order to test our theoretical predictions we make two numerical experiments 
in which we study the distribution for the maxima of n independent random 
variables with a probability density p. The actual size of the systems is chosen 
so that the perturbative approach discussed above applies and the experi- 
ment is repeated a number of times large enough to make the statistical error 
much smaller than the finite size corrections. The numerical simulations are 
performed by generating n independent random variables and selecting their 
maximum. We divide the interval into 50 bins and after repeating the ex- 
periment N times we obtain the frecuency with which the maximum belongs 
to a given bin. The frequency, properly normalized, will be our numerical 
approximation to p n = T s p, with n = e s . 

Our first example for p is the tent distribution, whose probability density 
is given by 

' Ax, x<l/2, 
4- 4x, x> 1/2. 



p(x) 



Observe that the support of p is the interval [0, 1]. It is plotted, together with 
the density of its limiting distribution, in Fig. [T] 

The cumulative distribution function determined by p is 




x < 1/2, 
x > 1/2, 



that converges under the action of the RG transformation for a = 2 to the 
cumulative distribution function 

M(x) = e- 2( - loga;)2 . 



0.0 0.2 0.4 0.6 0.8 1.0 

X 



Figure 1: Probability density function of (I5.1j) (solid line) and its limiting 
function (dashed line). 

If we perform the expansion in (j4.2|) we obtain 

19 9 

H{x) = M(x)(l + 2(-logx) 3 - — (-log x) A + -{- log xf + •••)• 

u 2 

The most relevant (or rather the least irrelevant) eigenvalue in the expansion 
is 1/3 = n" 1 ! 2 and it determines the behaviour with n of the amplitude of the 
finite-size corrections. In order to quantify the corrections when the number of 
random variables is n, we use the L 1 norm for the difference of the probability 
densities. This norm is also called total variation metric in the context of 
probability theory (see [T7] and references therein). We expand 

A:= f \p n {x) - M'(x)\dx = c 1/2 n^/ 2 + cm" 1 + c 3/2 n- 3 / 2 + ■ ■ ■ (5.2) 

JO 

where p n = T s p with n = e s . The first coefficient is given by 



Cl/2 



2 



o 



A[ M (x)log(x) 3 ] 



dx = ^V3e" 3 / 2 , 



and, similarly, one can compute the others to obtain 

ci = -fe" 3 / 2 , c 3/2 = |V3e- 3 / 2 ,... 

In Fig. [2] we have plotted the finite-size corrections to the distribution of 
the maxima obtained numerically scaled with y/n, for different values of the 
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0.58 



0.56 

< 

0.54 
0.52 

°- 50 'o' 500 1000 1500 2000 2500 3000 

n 

Figure 2: Finite size corrections rescaled with y/n as a function of the size of 
the system n. Dots represent the values obtained in the numerical experiment, 
with errobars corresponding to two standard deviations, and the solid line is 
the theoretical prediction for \/nA up to the n _1 term. 

size of the system n. We observe a very good agreement with the theoretical 
predictions in (|5.2|) . 

The second prediction that we test numerically is the shape of the cor- 
rections. In this case we take a fixed (and large) value for n and we plot 
the rescaled difference between the limiting distribution and the one obtained 
numerically for the maxima of n random variables distributed according to p. 
The finite-size corrections 5(x) := (p n (x) — M'(x)) can be expanded as 

S(x) = 5 1/2 (x)n~ 1/2 + <5i(z;)n -1 + • • • , 

with the first coefficients given by 

5 1/2 {x) = -2^-[M{x)\og{xf], 

A 1 q 

Si(x) = — [M(x)log(x) 4 (- -21og(x) 2 )]. (5.3) 

In Fig. 3 the dots represent the points obtained with the numerical exper- 
iment for y/n5(x) corresponding to n = 3000. The error bars, a little larger 
than the size of the dots, represent the statistical uncertainty due to the lim- 
ited size of the sample. The dashed line is <5 1 / 2 (^) as defined in (|5.3p while the 
solid line includes the next correction 5 1 / 2 (x) + 5i(x)n -1 / 2 . We see a wonderful 
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0.0 0.2 0.4 0.6 0.8 1.0 

X 



Figure 3: Shape corrections for n = 3000. Dots represent the outcome of the 
numerical experiment. The errorbars, representing two standard deviations, 
are of the order of the dot size in the plot. The dashed line is the leading term 
of the correction #1/2 fa) while the solid line includes the next subleading term 
5 1/2 (x)+5 1 (x)n- 1 / 2 . 



agreement between the theoretical prediction and the numerical experiment, 
especially when the subleading correction is included. 
The second example has a probability density 

p(x) = |2 - 4x|, 

and a cumulative distribution function 



H(x) 



2x(l-s), x<l/2, 
l-2x(l-x), x> 1/2. 



It converges under the action of the RG with a = 1 to Mfa) = x 2 . The 
limiting probability density is M'{x) = 2x. We can expand again, 

//fa) = Mfa)(l + {-logxf + (-logx) 3 + — (-logx) 4 + ---), 

and we find that the most relevant perturbation has an eigenvalue z/ 2 = n~ l , 
which determines the leading behavior with n of the amplitude of the finite-size 
corrections. If we also keep the first subleading terms we obtain 

A= f \p n (x) - M'(x)\dx = cin" 1 + c 2 n" 2 + c 3 n" 3 H (5.4) 

Jo 
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0.3 1 



0.30 



< 



0.27 




Figure 4: Finite size corrections in the second example scaled with n. In the 
x axis we represent the size of the system, n. Dots are the values obtained 
from the numerical experiment and the solid line is the theoretical prediction 
(|5.4p for nA up to the n~ 2 term. 



with 

ci = 2e~ 2 , c 2 = 3e" 2 , c 3 = -e~ 2 . 

In Fig. U]we check the validity of this expansion. We see that within the sta- 
tistical errors, due to the limited size of the sample, the finite-size corrections 
agree with the theoretical predictions. 

The shape of the corrections in this case is 

5(x) = 5i(x)n~ l + 5 2 (x)n~ 2 ^ (5.5) 

with the different contributions given by 

5\(x) = 2xlogx(l — logx), 

5 2 {x) = xlog(x) 2 (log(x) 2 - 3). (5.6) 

In Fig. [5]we show the numerical value for the shape correction and compare 
it with the analytical prediction in (I5.5p . We can see again a remarkable 
agreement between the numerical experiment and the theoretical prediction. 

In the previous examples we have tested the accuracy of the finite size 
analysis carried out in Section |U The size of the system and the sample have 
been chosen so that the computational time is reasonable and the errors are 
sufficiently small not to spoil any predictive power. Within this range we have 
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Figure 5: Shape corrections for the second example. Here n = 300. Dots 
represent the outcome of the numerical experiment. The error bars stand for 
two standar errors. The dashed line is the leading term of the correction, 
(5i(x), while the solid line includes the next subleading term, 5\(x) + 52(x)n~ 1 . 
The dashed line almost coincides with the solid line; only near the maximum 
the difference can be appreciated. 

to go beyond first order corrections (represented by the dashed line in Figs. [3] 
and [5]) to give a precise description of the experimental results. The number 
of terms needed depends, of course, on the concrete details of the problem and 
the required accuracy. 

6 Conclusions 

By employing Renormalization Group techniques we have studied the limit 
distribution of the appropriately rescaled maximum value of a sequence of n 
independent, identically distributed random variables when n — > oo. Obvi- 
ously, the rescaling is needed for obtaning non-trivial limits. To the present 
time only affine rescalings had been considered (perhaps in analogy to the 
treatment of the problem of stable distributions) without further justifica- 
tion. However, when studying limits of sequences of maxima of independent, 
identically distributed random variables, it seems natural to impose that the 
rescaling preserves the support of the original random variable. 

We have recast the problem of finding such limit distributions into the lan- 
guage of the Renormalization Group, explained how the condition of support 
preservation naturally arises, and what its implications are. The main result 
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of this paper is to show that when non-affine rescalings are allowed new inter- 
esting limit distributions are obtained that do not belong to the well-known 
Gumbel, Weibull, or Frechet families. In this formulation the limit distribu- 
tions are fixed points of the Renormalization Group transformation. After the 
identification and discussion of the fixed points we have studied the differential 
of the transformation around them (with emphasis on the new ones) in order 
to understand the domains of attraction and the corrections due to large but 
finite n, the so-called finite-size corrections. 

An interesting technical aspect of the approach herein adopted is the con- 
crete form of the definition of the Renormalization Group transformation. We 
define it as an uniparametric group of transformations that is fixed once for 
all, differing from other works in this line where the transformation can be 
adapted at every step. This fact has some consequences, especially concerning 
the domain of attraction of the fixed points. Indeed, within our approach the 
determination of the domain of attraction is simpler. 

Finally, one may wonder whether it is possible to modify the classical 
results on the domains of attraction when we restrict to transformations that 
preserve the support of the original random variable. 
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